Implicit relevance feedback from a multi-step search process: a use of query-logs

نویسندگان

  • Corrado Boscarino
  • Arjen P. de Vries
  • Vera Hollink
  • Jacco van Ossenbruggen
چکیده

We evaluate the use of clickthrough information as implicit relevance feedback in sessions. We employ records of user interactions with a commercial news picture portal: issued queries, clicked images, and purchased content. Our study investigates how much of a session’s search history (if any) should be used in a feedback loop. We assess the benefit of using clicked data as positive tokens of relevance to the task of estimating the probability of an image to be purchased. We find that a short history of past queries helps improve ranking, and that terms derived from clicked documents lead to a much higher effectiveness, while blind relevance feedback is not beneficial for the task. 1 Evidence of user interaction: Query Logs (QL) Logs of queries issued and the subsequent interactions with the query results, briefly referred to as ‘query logs’ (QLs) in this paper, provide a basis to adapt a relevance model to reflect what we have learned about the user’s information need. A set of QLs recorded when subscribers to Belga Picture were searching for images to be purchased online, allows us 1) to investigate how valuable clicks are as source of (implicit) relevance feedback in a multi-step search session and 2) to observe how much search history (if any) may lead to an improvement in the ranking of what we believe to be a determinately relevant document: the picture that a user is known to have purchased at the end of a search session. A QL registers, for each session, three types of user interactions: query submissions (Q), a possibly empty set of clicks (C) on the retrieved results, and, purchases (P); an anonymous identifier labels each step. Previous studies diverge in their findings about how much evidence of user interactions (Q and C) should be used for feedback: Tan et al. report in [5] that long term search history may improve web retrieval, while the authors of [4] argue to emphasize short-term query context. Also, Gong et al. question whether clicked data should be accepted as positive evidence of a document being relevant without a quality 1 A European news agency: http://picture.belga.be/picture-home/index.html, log data collected within the VITALAS project: http://vitalas.ercim.org.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query expansion based on relevance feedback and latent semantic analysis

Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

Learning to Detect Event-Related Queries for Web Search

In many cases, a user turns to search engines to find information about real-world situations, namely, political elections, sport competitions, or natural disasters. Such temporal querying behavior can be observed through a significant number of event-related queries generated in web search. In this paper, we study the task of detecting event-related queries, which is the first step for underst...

متن کامل

Universität des Saarlandes , FR Informatik Max - Planck - Institut für Informatik , AG 5 Query - log based Authority Analysis for Web Information

The ongoing explosion of web information calls for more intelligent and personalized methods towards better search result quality for advanced queries. Query logs and click streams obtained from web browsers or search engines can contribute to better quality by exploiting the collaborative recommendations that are implicitly embedded in this information. The method presented in this work incorp...

متن کامل

Improving Re-ranking of Search Results Using Collaborative Filtering

Search Engines today often return a large volume of results with possibly a few relevant results. The notion of relevance is subjective and depends on the user and the context of search. Re-ranking of these results to reflect the most relevant results to the user, using a user profile built from the relevance feedback has proved to provide good results. Our approach assumes implicit feedback ga...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011